Rank in Wordlist | Frequency | Word |
---|---|---|
5793 | 31 | 1,5 |
6475 | 27 | 2,5 |
6662 | 26 | 1,2 |
8594 | 19 | 3,5 |
9907 | 16 | 1,6 |
10431 | 15 | 1,3 |
10455 | 15 | 4,5 |
11654 | 13 | 1,8 |
11678 | 13 | 2,3 |
11691 | 13 | 7,5 |
Rank in Wordlist | Frequency | Word |
---|---|---|
39529 | 2 | .) |
Rank in Wordlist | Frequency | Word |
---|---|---|
3560 | 54 | 20% |
4015 | 47 | 30% |
4085 | 46 | 10% |
4249 | 44 | 100% |
4250 | 44 | 50% |
4968 | 37 | 80% |
5485 | 33 | 15% |
5488 | 33 | 60% |
5797 | 31 | 5% |
5945 | 30 | 40% |
Rank in Wordlist | Frequency | Word |
---|---|---|
11729 | 13 | CD&V |
14458 | 10 | S&P |
20518 | 6 | H&M |
33952 | 3 | R&D |
61056 | 1 | AT&T |
65600 | 1 | Blood&Honour |
66729 | 1 | Business&Decision |
70840 | 1 | D&G |
70990 | 1 | DJIBOUTI&CO |
73075 | 1 | E&Y |
Rank in Wordlist | Frequency | Word |
---|---|---|
11012 | 14 | $ US |
57134 | 1 | $ CA |
57956 | 1 | 133$US |
58520 | 1 | 19G$US |
58952 | 1 | 23,55$US |
59401 | 1 | 33,05$US |
59407 | 1 | 33,92$/b |
59436 | 1 | 34,38$US |
59775 | 1 | 45$US |
60164 | 1 | 60$US |
Rank in Wordlist | Frequency | Word |
---|---|---|
318 | 553 | ." |
Rank in Wordlist | Frequency | Word |
---|---|---|
100 | 1503 | d'un |
113 | 1374 | d'une |
134 | 1160 | c'est |
147 | 1055 | qu'il |
156 | 990 | C'est |
203 | 795 | n'est |
204 | 795 | s'est |
259 | 641 | n'a |
560 | 324 | d'euros |
609 | 300 | qu'elle |
Rank in Wordlist | Frequency | Word |
---|---|---|
57473 | 1 | 1+1 |
58155 | 1 | 16+1 |
59884 | 1 | 5+1 |
60699 | 1 | 90+1 |
60700 | 1 | 90+3 |
60716 | 1 | 90e+1 |
76163 | 1 | GMT+1 |
79096 | 1 | Huber+Suhner |
96937 | 1 | S+P |
99761 | 1 | Sports+Blaton |
Rank in Wordlist | Frequency | Word |
---|---|---|
3293 | 59 | https://www |
4055 | 47 | km/h |
5857 | 31 | et/ou |
12832 | 12 | https://t |
14187 | 10 | 24h/24 |
14481 | 10 | Telbec/ |
15402 | 9 | CNW/ |
22693 | 5 | 1/2 |
23116 | 5 | FIGAROVOX/TRIBUNE |
26183 | 4 | 2016/2017 |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots